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PROBE 

The present invention relates to DNA probes for 
detecting a tandemly-r epeated nucleotide sequence in the 
gene encoding mucin glycoprotein expressed by human 
mammary epithelial cells, to the use of the probe in 
diagnosis and in "fingerprinting" individuals, to the 
polypeptides expressed by the corresponding mucin gene, to 
antibodies against the polypeptides and to the use of the 
polypeptides and antibodies in the diagnosis and 
therapeutic treatment of cancer. 

Normal and malignant human mammary epithelial 
cells express high molecular weight glycoproteins (gps) 
which are extensively glycosylated and very antigenic. As 
a result, many of the monoclonal antibodies (MAbs) selected 
for reactivity with human breast cancer and other 
carcinomas are found to react with molecules which are 
produced in abundance by the fully differentiated human 
mammary tissue and are found in the milk fat globule (MFG) 
and in milk. However, the level of expression of a 
particular antigenic determinant may be different in the 
gps produced by the normal differentiated cell and in the 
similar molecules produced by breast cancers. This means 
that some antibodies can show a certain specificity for 
reactin g wi th _tum_ojir_gps • 
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The molecules bearing the epitopes recognised by 
these antibodies are complex and have been difficult to 
analyse r both because they are large and heavily 
glycosylated {>250r000 daltdns) and because of the complex 
pattern of expression. Two of the MAbs, HMFG-1 and -2^ 
react with a component in human milk which appears to be 
greater than 400^000 daltons, whereas the molecules found 
in sera and tumours are smaller, although the dominant 
components are still greater than 200,000 daltpns on 
immunoblots- The large glycoprotein produced by the 
differentiated mammary epithelial cells found in human milk 
or in the milk fat globule has. been purified and shown to 
have some of the characteristics of the mucins. This 
component -contains a large amount of carbohydrate joined in 
0-linkage to serine and threonine residues via the linkage 
sugar N-acetylgalactosamine. Moreover, the core protein 
contains high levels of serine, threonine and proline and 
low levels of aromatic and sulphur containing amino acids. 

These mucin-like glycoproteins are also secreted 
by a number of other . normal epithelial cells. The 
monoclonal antibody HMFG-1 is highly reactive with the milk 
mucin and evidence suggests that the epitope recognised by 
this antibody is more abundant on the fully processed 
mucin, characteristic of normal differentiation. 

In tumours, the molecular weight of the molecules 
carrying these antigenic determinants differs among 
individual "tumour's and , in the case of the components 
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recognised by the HMFG-2 antibody, can range from 80-400K 
daltons. Although it appears that the differences observed 
. in the mobility of the high molecular weight bands are due 
to genetic polymorphism this probably does not explain 
variations in the size of the lower bands. It has been 
proposed that these may be the result of aberrant 
processing occurring in the tumour cell possibly within the 
glycosylation pathways • 

For the majority of the monoclonal antibodies 
reacting with this group of molecules the exact nature of 
the antigenic epitopes remains unclear but circumstantial 
evidence has suggested that carbohydrate may at least be 
partly involved in many of the epitopes. Moreover, from 
previously available data it was not known whether the 
mucin found in the normal differentiated cells, and that 
observed in the tumours, contain the same core protein, or 
just carry common carbohydrate determinants. 

Mucin has now been isolated from human milk by 
affinity chromatography enabling identification of the core 
protein and the gene encoding the protein. This has been 
found to be a highly polymorphic gene defined by the peanut 
urinary mucin (PUM) locus [see Swallow et al . , Disease 
Markers , 4^ 247, (1986) and Nature , 327 , 82-84 (1987)] • 
The gene product, which is hereafter referred to as human 
polymorphic epitlielial mucin or HPEM, has been detected in 
breast tumours and other carcinomas as well as in some 
normal epithelial tissues. 
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It has now been found that the HPEM core protein 
has epitopes which also appear in the aberrantly processed 
gps produced by adenocarcinoma cells. Certain of these 
epitopes are not exposed in the fully processed mucin, 
glycoprotein produced by the lactating mammary gland. 

In one aspect the present invention therefore 
provides an antibody against a human mucin core protein 
which antibody substantially does not react with a fully 
processed human mucin glycoprotein. 

As used herein the term "antibody" is intended to 
include fragments of antibodies bearing antigen binding 
sites. such as the FCab')^ fragments. 

Antibodies according to. the present invention 
react with HPEM core protein, especially as expressed by 
colon, lung, ovary and particularly breast carcinomas, but 
have reduced or no reaction with the corresponding fully 
processed HPEM, In a particular aspect the antibodies 
react with HPEM core protein but not with fully processed 
HPEM glycoprotein as produced by the normal lactating human 
mammary gland. 

Antibodies according to the present invention 
have no significant reaction with the mucin glycoproteins 
produced by pregnant or lactating mammary epithelial 
tissues but react with the mucin proteins expressed by 
mammary epithelial adenocarcinoma cells. These antibodies 
show a much reduced reaction with benign breast tumours and 
-are- theref ore" use'ful" i"n "the "diargnosi"s ~ahd~ localisation of 
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breast cancer as well as in therapeutic methods. 

The antibodies may be used for other purposes 
including screening cell cultures for the polypeptide 
expression product of the human mammary epithelial mucin 
gene, or fragments thereof, particularly the nascent 
expression product. In this case the antibodies may 
conveniently be polyclonal or monoclonal antibodies. 

Antibodies according to the present invention may 
be produced by innoculation of suitable animals with HPEM 
core protein or a fragment thereof such as the peptides 
described below. Monoclonal antibodies are produced by the 
method of Kohler & Milstein (Nature 256, 495-497/1975) by 
immortalising spleen cells from an animal innoculated with 
the mucin core protein or a fragment thereof, usually by 
fusion with an immortal cell line (preferably a myeloma 
cell line) , of the same or a different species as the 
innoculated animal, followed by the appropriate cloning and 
screening steps. 

In a particular aspect the present invention 
provides the monoclonal antibodies designated SMS against 
the HPEM core -protein. In another aspect the invention 
provides the hybridoma cell line which secretes the 
antibodies SM3 and has been designated HSM3. Samples of 
HSM3 have been deposited with ECACC on 7th January 1987 
under accession number 87010701. 

Using antibodies according to the invention it 
has- been-possible- to screen--a -phage -li-brary-constructed 
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from mRNA isolated from a human breast cancer cell line to 
identify sequences coding for portions of the mucin core 
protein • Complementary DNA. sequences have been constructed 
and from these it has surprisingly been found that the gene 
encoding the core protein contains multiple tandem repeats 
of a 60 base sequence leading to considerable polymorphism 
sufficiently extensive that cDNA fragments corresponding to 
the repeat sequence would be useful for fingerprinting DNA. 
The fingerprinting thus made possible has applications in 
for instance ascertaining whether bone marrow growth after 
transplants is from the host or the. donor and in forensic 
medicine for identifying individuals using body tissues or 
fluids. 

Accordingly the present invention also provides a 
nucleic acid fragment comprising at least 17 nucleotide 
bases the fragment being hybridisable with at least one of 

a) the DNA sequence 

. 5 ' * 
ACC GTG GGC TGG GGG GGC GGT GGA GCC CGG- 

GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC- 

* 

ACC GTG GGC TGG GGG GGC GGT GGA GCC CGG- 

3' 

GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC 

b) DNA complementary to the DNA of a) , i.e. of sequence 

5' 

GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC- 
* 

CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT- 

GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC- 
* 3* 
CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT_ _ - 
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c) RNA having a sequence corresponding to the DNA sequence 
of a) and 

d) RNA having a sequence corresponding to the complementary 
DNA sequence of b) . 

The sequences in (a) and (b) each include a double 
tandem repeat sequence of 120 bases. Fragments according to 
the invention may correspond to any portion of this sequence 
including portions bridging the start point of the repeat. 

Fragments according to the invention will 
hybridise under conditions of low stringency with the DNA 
and RNA sequences (a) to (d) above. Preferred fragments are 
those which also hybridise under conditions of high 
stringency. The most preferred fragments of the invention 
are those which have sequences exactly identical to, or 
exactly complementary to the sequences (a) to (d) above. 

Normally a given DNA or RNA fragment according to 
the invention will be capable of hybridising with both DNA 
according to a) and RNA according to c) or with both DNA 
according to b) and RNA according to d) above* 

Preferably the nucleic acid fragment according to 
the present invention will comprise a portion of at least 30 
nucleotide bases capable of hybridising with at least one of 

a) to d) above, more preferably at least 50 such bases and 
most preferably the fragment contains a sequence of 60 bases 
exactly complementary to one of the repeat sequences of a) , 

b) c) or d) above. Other fragments of the invention may 
comprise two or more repeats of such a sequence, optional ly 
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with minor- variations by way. of substitution • Preferably 
such fragments include an integral number of such repeat 
sequences. Further fragments of the invention comprise the 
tandem repeat sequence and additional coding or non-coding 
.5' and/or 3' flanking sequences corresponding to the HPEM 
gene or a portion thereof. 

When the existence of a tandem repeat sequence was 
first identified it was believed that the sequence consisted 
of 59 base pairs corresponding with the sequences indicated 
in (a) and (b) above except for the lack of the base 
indicated with 

Many fragments according to the invention as 
originally defined in British Patent Application No, 8700269 
also conform with the new definition of fragments as set out 
herein and those fragments of sequences defined under (a) t 
(W f (c) or (d) above which do not include the bases marked. 

form a particular aspect of the present invention. Such 
fragments have sequences corresponding to at least a portion 
of the sequences 

aM GTG GGC TGG GGG GGC GGT GGA GCC 

a ' » ) CGG GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC AC 

b') DNA complementary to the sequence of a') or a''), 

c') RNA having a sequence corresponding to the sequence of 

a' ) or a' • ) and 
d*) RNA having a sequence corresponding to one of the 

coraprementary DNA sequences of b') 
— In the' human genome . the" DNA tandem "repeat sequence 
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comprises antiparallel double stranded DNA, one strand 
having sequence (a) and being paired with a strand having 

sequence (b) . 

As mentioned above the nucleic acid fragments of 
the invention may be used as a probe for detecting one or 
other strand of the DNA tandem repeat sequence in the human 
genome, or RNA transcribed from either strand and hence for 
identifying the gene or genes for human mucin core proteins, 
mRNA transcribed therefrom and complementary DNA and RNA. 
For such purposes it may be convenient to use the complete 
normal gene comprising at least one tandem repeat sequence, 
or mRNA transcribed therefrom or to attach non-complementary 
fragments to either or both the 5' and 3' ends of a fragment 
according to the invention and/or to attach detectable 
labels (such as radioisotopes, fluorescent or enzyme labels) 
to the probe or to bind the probe to a solid support. All 
of these may be achieved by conventional methods and the 
nucleic acid fragments of the invention may be produced de 
novo by conventional nucleic acid synthesis techniques. 

The nucleic acid fragments of the present 
invention may also be used in active immunisation 
techniques. In such methods the fragment codes for a 
polypeptide chain substantially identical to a portion of 
the mucin core protein and may be extended at either or both 
the 5' and 3' ends with further coding or non-coding nucleic 
acid sequences including regulatory and promoter sequences, 
mar keiL- sequences, and -spl ic-i-ng-or— 1 i-ga-t-ing— s i tes-. ~ Coding - 
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sequences may code for corresponding portions of the mucin 
core protein chain or for other polypeptide chains. The 
fragment according to the invention, together with any 
necessary or desirable flanking sequences is inserted, in an 
appropriate open reading frame register, into a suitable 
vector such. as a plasmid or a viral genome (for instance 
vaccinia virus genome) and is then expressed as a 
polypeptide product by conventional techniques. In one 
aspect the polypeptide product may be produced by culturing 
appropriate cells transformed with a vector, harvested and 
used as an immunogen to induce: active immunity against the 
mucin core protein.- In another aspect the vector, 
particularly in the form of a virus, may be directly 
innoculated into a human or animal to.be immunised. The 
vector then directs expression of the polypeptide in vivo 
and this in turn serves as an immunogen to induce active 
immunity against .the mucin core protein. 

The invention therefore provides nucleic acid 
fragments as hereinbefore defined for use in methods of 
treatment of the human or animal body by surgery or therapy 
and in diagnostic methods practised on the human or animal 
body. The invention also provides such methods for 
treatment of the human or animal body by surgery or therapy 
and diagnostic methods practised in vivo as well as ex_ vivo 
and: in- vitro. 

The invention further provides a polypeptide 
comprising a series of residues encoded by the DNA tandem 
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repeat sequence, the sequence shown at (b) above being the 
coding sequence. Polypeptides according to the invention 
are selected from any of those having 5 or more amino acid 
residues represented by the following amino acid sequence 

Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr 
Ala Pro Pro Ala His Gly*Val Thr Ser Ala Pro Asp Thr Arg 
Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly 
("*" marks the start of the repeat in the peptide) . 

Polypeptides according to the invention may have 
a sequence corresponding with any portion of the 40 
residue sequence above and may include the start point of 
the repeat sequence. 

Other polypeptides according to the invention 
include three or more repeats of the 20 amino acid repeat 
sequence. Such polypeptides may include minor variations 
by way of substitution of individual amino acid residues. 

The invention further provides polypeptides as 
defined above modified by addition of N-acetyl 
galactosamine (a linkage sugar) on serine and/or threonine 
residues and by addition of oligosaccharide moieties to 
that or via other linkage sugars and/or fragments linked 
to carrier proteins such as keyhole limpet haemocyanin, 
albumen or thyr oglobul in . 

Preferably the polypeptide comprises at least 10 
amino acid residues of the sequence above, more preferably 
20 residues. "The polypeptide may comprise the full 
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sequence above.- Such polypeptides may further comprise 
additional conino acid . residues , preferably conforming to 
the amino acid sequence of HPEM core protein. 

In a further aspect the present invention 
provides the HPEM core protein. This is encoded by the 
PUM gene and may be produced by recombinant DNA techniques 
and expressed without glycosylation in human or non-human 
cells. Alternatively it may be obtained by stripping 
carbohydrate from native human mucin glycoprotein which 
itself may be produced by -isolation from samples of human 
tissue or body fluids or by expression and full processing 
in a human cell line. The HPEM core protein may be used 
for raising antibodies in animals for use in passive 
immunisation, diagnostic tests and tumour localisation and 
in active immunisation of humans. 

The invention further provides antibodies 
(monoclonal or polyclonal) , and fragments thereof, against 
any of the polypep.tides described above. Such antibodies 
may be obtained by conventional methods and are useful in 
diagnostic and therapeutic applications. 

The invention further provides antibodies 
(monoclonal or polyclonal), or fragments thereof, linked 
to therapeutically or diagnostically effective ligands. 
For therapeutic use of the antibodies the ligands are 
lethal agents to be delivered to cancerous breast or other 
tissue in order to incapacitate or kill transformed cells. 
Lethal- agents include ' toxins ; "radioisotopes and ^direct 
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killing agents' such as components of complement as well 
as cytotoxic or other drugs. Further therapeutic uses of 
the antibodies inclusive passive immunisation. 

The invention further provides therapeutic 
methods comprising the administration of effective 
non-toxic amounts of such antibodies or fragments thereof 
and antibodies or fragments thereof for use in therapeutic 
treatment of the human or animal body. Especially in 
therapeutic applications it may be appropriate to modify 
the antibody by coupling the Fab region thereof to the Fc 
region of antibodies derived from the species to be 
treated (e.g. such that the Fab region of mouse , monoclonal 
antibodies may be administered with a human Fc region to 
avoid immune response by a human patient) or in order to 
vary the isotype of the antibody. 

In the diagnostic field the antibodies may be 
linked to ligands such as solid supports and detectable 
labels such as enzyme labels, chromophores and 
fluorophores as well as radioisotopes and other directly 
or indirectly detectable labels. Preferably monoclonal 
antibodies or fragments thereof are used in diagnosis. 

The invention further provides a diagnostic 
assay method comprising contacting a sample suspected to 
contain abnormal human mucin glycoproteins with an 
antibody as defined above. Such methods include tumour 
localisation involving administration to the patient of 
the,_antibody- or-- f r-agment--t-her eof -bear ing- ar detectable 
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label or of an antibody or fragment thereof and, 
separately simultaneously or sequentially in either order 
a labelling entity capable of selectively binding the 
antibody or fragment thereof. The invention also provides 
antibodies or fragments thereof for use in diagnostic 
methods practised on the human or animal body. 

Particular uses of "the antibodies include 
diagnostic assays for detecting and/or assessing the 
severity of breast, ovary and lung cancers. 

Diagnostic test kits are provided for use in 
diagnostic assays and comprise antibody or a fragment 
thereof,, optionally suitable labels and other reagents 
and, especially for use in competitive assays, standard 
sera. 

The. invention will now be illustrated by the 
following Examples and with reference to the figures of 
the accompanying drawings in which 
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yirure Legends 

1 : Purification of the ailk aiuciii by iaaunoaffinity 
chroMt graphy Min^ the antibody a£FG-1. Milks from aeTeral 
isdiTiduala vere combined and abaorbed to a E^G^I-feph&roae column as 
described in Methoda, The material alutinc at low pH w iodinated 
and aubjected to PAGE alectrophoreaia and autoradiography (track 1). 
The iodinated ftaterial was precipitated naing the Protein A uethod 
with antibodies mPG-l (track 5), M^G~2 (track 2), BT254 (track 3) 
and RPiri 4 20$ PCS (track 4). 

figure 2: Coniparison of the ''^^I labelled purified xnilk aiucin with 
Icsunoblot of huican aiizcaed ailk. A, huoan aJdicBed oilk was aubjected 
to SDS polyacrylaiLide electrophoresis, transferred to nitrocellulose 
paper, the blot probed with the oonoclonal antibody H1£PG-1 and binding 
detected using an ZLISA aethod. B, after purification on an HlffG-1 
affinity colTimn followed by G75 Sephadez chroaatography the ailk aucin 
was iodinated by the Bolton and Hunter aethod and aubjected to SDS 
polyacrylaaide electrophoresia and autoradiography. 

Fi glare S; Autoradiography of the iodinated ailk aucin after treataent 
vlth hydrogen fluoride. Ihe purified ailk aneln waa traatad with HP 
for S honra at rooa taaperatva (track 1) or 1 hour at 4*C (track 2) 
and the resulting preparationa were then lodinatad and run on SDS 
polyacrylaaide c*ls* 
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^^P^^ ReactiTity of the intact, partially .stripped or extenBively 

«tripp d nilk nucin with iodinated 1 ctins. The purified intact milk 
Bucia (track 1), the. amcia treated »ith HP fer 1 hour »t 4*C (track 2) 
and the nucin treated for 5 houra at room temperature (track 3) were 
•ubjected to SDS polyacrylamido electrophoresia and then transferred 
to altrocelluloBe paper. The paper was then probed »ith ""^^I PKA 
(peanut agglutinin) , ^^^^^^ cerm a^glutLain), or ""^^I HPA 

(Helix pomatia. agglutinin). 

ripure 5: Loauncprecipitation and iffisujiohletB of the partially tjid 
eitenEively stripped cucia. A, the ""^^I extenBively. stripped mucin 
inaunoprecipiteted with EK-S (track 5), Kl£FG-2 (track 2) or KS2 
oediua as a control (track 1) hy the protein A plate method (aee 
Materials and Methods). B, the partially stripped mucin (track 1) or 
eitensively stripped mucin (track 2) was run on SDS polyacrylamide 
eels and transferred to nitrocellulose paper. The blot was then 
reacted with a cocktail of SM-S and Sf-* monoclonal antibodies and the 
binding: detected xising an ZLISA method. 

fiRure 6; Reactivity of monoclonal antibodies SV:-3 and.aa'G-2 with 
•ethacam fixed breast tissue and tumour sections using an indirect 
laaunoperoxidaae staining method. Infiltrating ductal carcinoma 
•howing strong reactirity with both SM-S (A) and HKPG-2 (B) 
yibroadenoma lOiowihf no reactirity with SU-S (C) and strong 
heterogeneous staining of the epithelium with HMPC-2 (D) . Papilloma 
•bowing Tery weak reactivity with BM-S (E) and strong positivlty with 
affG-2 (P). Both normal r sting breast (G) and lactating breast (I) 
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were negativ »hen itained with SU-S, wher as both tiBsues stained 
positively with HUPC-2 with lactating br aat (J) auch stronger than 
soraal resting br ast (H) . 
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Figure Legends 

Figure 7. Periodic acid-silver atained milk mucin after antibody 
affinity column and gel filtration column. Milk mucin vae purified on 
an HUFG-'I antibody affinity column (lane 1} followed by passage 
tbrougb a G75 Sepliadez column (lane 2} « subjected to 
HaDodSO^/polyacrylamide gel electrophoresis » and silver stained 
following treatment of gels with 0^2% periodic acid. 

Figure r . Silver stain of partially and totally stripped core 
protein from milk mucin. The purified milk mucin was deglycosylated 
by treatment with anhydrous hydrogen fluoride for 1 hr at 0*C (lane T) 
and 3 hr at room temperature (lane 2), separated by electrophoresis 
through a NaDodSO^/polyacrylamide gel (10%) and silver stained. 

Figure Q . Immunoprecipitation with MAhs of in vitro translated 
protein products from poly(A)'*" RNA. Poly{A>"*' RNA from MCP-7 

cells was translated in a rabbit reticulocyte lysate system (Amersham) 
in the presence of [^^SJmethionine (1000 Ci/mmole; ICi = S7 GBq) 
following the manufacturer's conditions. Samples containing 5x10^ 
acid precipitable cpm were precipitated with UAhs SU-4 (lane a) , Sli-3 
(lane b), HMFG-2 (lane c), HMFG-1 (lane d) and an irrelevant MAb to 
interferon (lane e, 24) p separated on a KaDodSO^/polyacrylamide gel 
(10X), impregnated with Amplify and exposed to lAil-S film at -70 'C for 
20 days . 
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Figure 10 , Imm\moblot analysis of fusion pr teins from the Xmuc 

cl nee. Tb phag clon s XUUG 3»4,6|7»6,9 and 10 were used to 

IjBOgenize bacterial strain 7 1069» lorsogens w re grown at 32*0, 

shifted to 42*0, and then induced with IFTG. I^ysogen proteins were 

fractionated by electrophoresie through a NaDodSO^/polyacrylaznide gel 

4 

(7*5%), transferred to nitrocelluloae , and reached with EU7Cj*2. The 
binding was detected with an ELISA method izsing 4-chloro-1 -aaphthol as 
the substrate. The numbers are those of the X clones, 

Figure 11 , .hybridization of pMUCIO to cDNA inserts of pMUC clones. 
DKA from the plasmid clones was digested with restriction enzyme EcoRI 
to excise the cDNA inserts, separated by electrophoresis on 1.4% 
agarose and transferred to Biodyne nylon membrane. The filter was 
hybridized using standard conditions (54) to the insert from plfUClO 
which was labelled with [a- p]dCTP by the method of random priming 
(41), Lanes: plasmid clones 3,4,6,7,6,9,10. 

Figure 12 » RNA blot hybridization analysis of mammary breast mucin 
mRNA. 10 )ig of total HNA from human breast cancer cells MCF-7 (lane 
1 ) and T47D (lane 2) , normal human mammary epithelial cells HuME (lane 
S}, human embryonic fibroblasts ICHF 23 (lane 4), Daudi cells (lane 5) 
and carcinosarcoma HS57BT cells (lane 6) were separated in a 1,3% 
agarose/glyoxal gel, blotted on to nitrocellulose and hybridized to 
the pUUCIO EcoKI insert which was labelled with [a-^^p]dCIP by the 
method of random priming (41 }• Ihe size markers were 28S (5.4 kb) and 
18S (2,1kb) rRMAs, 
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Polym rphic human DNA fragmente det ct d by 

hybridization with piiUCI 0 probe. Genomic DNA Bamples prepared from 

the white blood cells from, tea individuals (six unrelated) and from 

three cell lines were digested to completion with Hinfl and EcoRI, 

fractionated by electrphoresis through 0,1% and 0.6% agarose, 

respectively, and transferred to Biodyne nylon aeabranes. The filter 

was hybridized to the plTOCIO DNA insert ^ch was labelled with 
,52 

[o p]dCTP by the method of random priming (41). X-ray film wslb 
ezpoeed for 1 day at -70 'C with intensifying screens. Lanes 1-4 
father, two daughters and mother, lanes 5-10 unrelated individuals, 
lane IT is MC?-7, lane 12 is ZR75-1, lane 13 is I0E?-23. The DNA 
samples exhibit a wide distribution of sizes. Numbers indicate length 
of DNA in Xb. The apparent bands at 23Ej are in lanes 12 and 13 are 
artefacts introduced in autoradiography 
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Example 1 

PurificEtlon f the milk mucin 

The silk mucin vae purified from human fikimmed silk by passage 
through an HiFG-l iLffinity colmn followed by sixe excluBion 
chromatof raphy« The Hk7G«*1 sonoclonal cntibody vas purified from 
tissue cislture supernatant using a protein A column (1). The purified 
mntibody vas coupled to cyanogen bromide activated sepharofie 
(Pharmacia) as described In the manufacturer's instructione . Human 
skimmed milk vae passed in batches of 100 sD. throu|:h the antibody 
co3u=m followed by extensive washing with PBS, Bound txtigen was 
eluted from the column using 0,1 U glycine pH 2.5 and the fractions 
registering an optical density at 260nni were pooled, dialysed against 
0.25 M acetic acid and lyophilized^ Batches of about 20 mgs were 
dissolved in 0.25 M acetic acid and passed through a G75 Sephadex 
column (1 X 100 cm) which liad been previously equiligrated with acetic 
acid. The column was washed with 0.25 V acetic acid and 1ml fractions 
collected. The peak fractions which were eluted in the void volume 
were pooled, lyophilized and the dry powdor stored at Amino acid 

analysis was performed using a Beckman 6500 amino acid analyser. 

DeglvcoBylation of the milk mucin 

To remove the 0-linked carbohydrate from the milk mucin the 
Bolecule was treated with anhydrous hydrogen fluoride mm described by 
Mort and Lamport f 21) • for either 1 hour at 4*C which produced a 
partially stripped preparation, or S houra at room temperature which 
produced the extensively stripped mucin. 
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. lodinatlon of th milk miein 

lodiaatioas of the purified aucin, the partially or •xtenaively 
•tripped oucin were carried out osiag the Bolton and Hunter aethod 
(51). Briefly, the aucin, 2.5 « in 20 |a 0.1M borate buffer pH 8.5, 
was added to the dried Bolton and Hunter reaeent (T BCi,. ibaeraham 
Interaational pic) and Incubated at roon temperature for 15 Biautea. 
The reaction was atopped by the addition of 0.5 al of 0.2M glycine in 
borate buffer and after a further 15 ainutea incubation, free Bolton 
and Hunter reagent was removed by paaaage through a G25 Sephadez 
column (PD10 columns, Pharmacia) preriously oquilihriated in PBS. 

lodination of Lectins 

Wheat germ agglutinin (WGA), peanut agglutinin (PHA) (Vector labs) 
and Helii pomatia agglutinin (HPA) (Boehringer) were iodinated as 
described by ZarlBaon et al. (52) using the chloraoine T method. 

Polvacrvlaaide eels and Western blots 

Polyacrylaaide gel electrophoreaia and iamunoblotting was 
performed as described previously (1). Briefly, aanples were run on 
5-1 5% polyacrylaaide gela and then electrophoratically tranaf erred to 
aitrocelluloae paper (Schleicher and Schuell) at 50 volta overnight 
At 4*C (3Q. In the inmunoblotting axperiaenta the paper waa reacted 
with aonodonal antibodiea and binding detected with an mSA aethod 
uaiag 4-chloro-1-naphthol as the lubatrate. ?or lectin binding 
•tudies the Western blots were reacted with the Iodinated lectina as 
described by Swallow t al. 68). 
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Production of gionoclonal Btntibodies 

A female BALB/c m use was iamunized with 5 of the partially 
•tripped ailk mucin in Preund'e cooplete adjuvant and S Months later 
boosted with a further 5 |ig of the same preparation In Preund's 
Incooplete adjuvant. After a further 20 days, 5 « of the mucin 
extensively stripped of its carbohydrate was f iven intravenously in 
saline solution. Die spleen was renoved 4 days later, and fused with 
the KS2 mouse myeloma cell line (5Q) • 

Screening of hybridoma suDematant and immunonrecipi tat ions 

The screening assay was a modification- of that described by Meier o 
and Gonzalez •Rodriguez (54^. Multiwell plates were coated with 50 >il 
of 0,1 mg/ml protein A (Pharmacia Pine Chemicals) in PBS and allowed 
to dry overnight at S7*C. The plates were blocked with 5% BSA for 1 
hour at SVC followed by the addition of 50 of rabbit anti -mouse 
inmiunoglobulin (luko, diluted 1:10 in PBS/BSA = PBS/BSA). After 
incubating for 2 hours at S7*C the plates were washed twice with PBS 
containing 1% BSA and 50 fil of hybridoma supernatant added. The 
plates were incubated overnight at 4*C, washed twice with PBS/BSA and 
50 111 of iodinated partially stripped mucin containing 100,000 cpm 
added to each well. Ttxe plates were then incubated at room 
temperature for 4 hours, washed 4 times with PBS/BSA axid the 
individual wells counted in a gamma counter. Por immunoprecipitation 
•xperiments 50 jil of SDS sample buffer containing dithiothrsitol was 
added to each of the wells which were then boiled for 5 minutes and 
th buffer loaded onto 5-15% polyacrylamide gradient gels. 
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Staining of tissu a ctlong 

Tissues from priaary aammary carcinomas, b aign breaat biopsies, 
normal breast , and pregnant lactating breast tissue were fixed in 
methacarn (methanol cblorofoim and acetic acid 60:30:10) and embedded 
into paraffin wax. Sections were atained with the antibodies using 
the indirect peroxidase anti peroxidase method as previously described 
(47). 

Results 

Purification of the milk mucin 

The milk mucin was purified from human skimmed milk on an H£FG-1 
antibody affinity column. lodination of the eluted material revealed 
the presence of a large molecular weight component and a 68ZD band. 
Precipitation of the affinity purified material with antibodies HiFO-l 
and HliFG-2 (tracks 2 and 5) followed by gel electrophoresis showed 
that both the high molecular weight components and the 68ZD component 
were precipitated by both antibodies (less effectively by ia£FG-2). 
Since the 68n) component was also precipitated by two unrelated 
antibodies (figure 1^ tracks S and 4) and this component was not 
•vident on an iamunoblot of the purified material reacted with HHPG-1 
(figure 2A), the 68E component was removed by molecular aiave 
chromatography on a 075 column. The final purified product ahowed a 
major high molecular weight band, with only a trace of the 68£' 
component and a minor contaminant around 14E (figure 2B). 
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A high molecular weight glycoprotein (PAS-O) containing mor than 
50% carbohydrate in O-linkag hae b en purifi d from the huaan milk 
fat globule by Shimizu and laaauchi (8)« To see i^ether this 
component was ■imilar to the anicin isolated from milk by affinity 
chromatography on an HMFG-I affinity column, the amino acid 
compoBition of the purified HMFG-I reactive mucin was determined and 
compared to the amino acid composition of the purified PAS-O 
component. Table 1 shows that there is good correspondence between 
the two sets of data, indicating that the core proteins of PAS-O and 
the mucin purified here are the same. 

Isolation of the core "protein of the milt mucin 

As there are no enzymes easily available that are efficient at 
. removing 0-linked sugars, and p elimination often results in damage to 
the protein core, the oligosaccharides were removed by treatment of 
the mucin with anhTdrous hydrogen fluoride. This treatment hae been 
•hOTO by Mort and Lamport (21) to be effective in removing sugars from 
pig BubaaziUary mucin without damaging the protein core, ikmino acid 
analysia of the material produced after HP treatment of the milk mucin 
■uggested that the protein core was also in this ease undamaged, since 
the eompoaition was the aame as that seen in the intact mucin (Table 
1). 

Initially the milk mucin was exposed to HP for only 1 hour at 4', 
but asBlysiB of the product shewed that there was only partial removal 
of the sugara with such treatment, and it was neceaaary to treat the 
mucin at room temperatur for 8 hours to obtain a molecule which 
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allowed no lectin binding . ability . " Figure 5 Bhowe an autoradiograph of 
. the iodinated products after treatment for 1 hour at 4* (track 2) or 3 
houra at RT (track 1). It can be aeen from figure 3 that the milder 
treatment reaulta' in a mixture of products made vP of higb molecular 
veifht material vhich la aligbtly smaller than the intact mucin and a 
number of smaller band&« After longer exposure to HP at room 
temperature 9 the bigb molecular veigbt bands disappeared resulting in 
polypeptide bands of about 6 BED and 

To test for the presence of augars on the intact mucin and on the 
products produced after the two different KF treatments each 

preparation was aubj acted to acrylamide gel clectrophsreEis , 

125 

transferred to nitrocellulose paper and rcact&d vifh "I-labelled 
lectins^ The lectins used Tere peanut lectin (PNA) vKich reacts with 
galactose linked to K-acetyl galactosemine » wheat germ (WGA) reactive 
with K-acetyl glucosamine and Helix pomatia agglutinin (HPA) which 
reacts with the linkage sugar in 0-linked glycosylstion, 
K-acetylgalactoeamine, Figure 4 shows autoradiographs of the reacted 
blots, and. it can be seen that while treatment xith H? for 1 hr at 4' 
(track 2) alters the lectin reactivity of the mucin, carbohydrate is 
still .present. Interestingly, however, there is a much lower level of 
binding of PNA to the high molecular weight material of the partially 
stripped mucin than is seen with the intact mucin (track 1). 
Horeover, this loss in FKA binding ability la accompanied by binding 
of the linkage sugar specific lectin HPA. This lectin ahowa no 
binding at all to the intact mucin, and the changed pattern of lectin 
binding after limited treatment with HF indicates that sugars masking 
the 0-linked N-acetylgalactosamine have been stripped off. The 
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•mailer c mponent b en in both the intact xBucin (track 1) and in the 
partially stripped preparation (track 2) is a glycoprotein which 
reacts ilth VGA, although not vith FKA, This say correspond to the 
component of similar molecular weight (around 681) seen after affinity 
chromatography of the arucin and may represent an intermediate 
precursor molecule. 

figure 4 shows clearly that the 6BI and 721 components produced 
after extensive treatment with KP (3 hr at RT) , show no reactivity 
with the lectins (track Z) ^ including the K-&cetylgalactoeamine 
specific lectin HPA. Thie obBerration constitutes strong evidence 
that the sugars hfive been recoveafroin at least the majority of the 
molecules, and we will refer to this ?rG?£raticn as the extensively 
stripped mucin. 

Generation of monoclonal antibodies to the milk mucin core Tarotein 

A ^^asion was carried out using the spleen of a mouse that had been 
immunized with two injections of the partially stripped milk mucin 
followed by a boost with the eztenEivtiy stripped mucin, Ihe clones 
were initially screened against the ""^^I partially stripped material 
using protein A plates (see Methods). Four hybridomas were selected 
and cloned, and table 2 shows their spectrum of reactivity with the 
intact » partially and extensively stripped mucin. As can be seen from 
IMs table three of the hybridomas which were isolated ahowed a strong 
reaction with the partially and extensively stripped mucin did not 
react with the intact mucin. Ihese appeared to be good candidates for 
monoclonal antibodies to the protein core and two, SU-S and SU-4, were 
•elected to be characterised further. 
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It can clBO be. seen froa table 2 that the a£PG-1 and HUFG-2 
antibodies r acted rery atrongly lith the mucin atripped of its 

arbohydrate. Ihese two antibodi a were, in fact, develop d using the 
Intact mucin (from the milk fat globule) as iamunogen and, in the case 
of HI£P0-2, trowing mammary apithelial cells p.4V. Their reaction with 
the stripped mucin was moxpected, m» circumstantial aridenee had 
previously led to the belief that carbohydrate might fom at least 
part of their antigenic apitopes. 

Molecular Weight of noleeulea e^rr y ing antigenic deteminA»t!« 

The antibody SM-S waa ahown to be of the Ig&l aubclass, while the 
SM-4 antibody was found to be IgM. We therefore choae to use the Sli-3 
antibody in aubsequent experiments aince antibodies of the IgM class 
cam present problems in some appliction. Immunoprecipitation of the 
extenaively stripped material with SM-5 ahowed a reaction with the 
lectin unreactive 68£ component (Figure 5A, track S). The monoclonal 
antibody HMFG-2 can also be seen to immune precipitate the 
lectin-unreactive 6BK component (track 2). The antibodies were 
reactive with antigen on immunoblots and Pigure 5B ahows the reaction 
of antibody SM-5 with the dominant 68£ band of the extensively 
stripped mucin (track 2) , 

We have previously shown that the molecular weight of the 
components in breast cancer cella carrying detexainanta found on the 
milk mucin is lower than 400Z and can wary from one tumour to wiother 
(1). Reaction 6t «itibody SU-S with Weatem blots of gel separated 
•xtracta of breast tumour cella shows that this antibody reacta with 
coaponeata Of aimilar molecular weight to thoa reactive with antibody 
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HMPG-2 (data not shown). Because the antibody SM*3 differs from the 
mntibodiaa HM?G-1 and 2 In that it does not react with the Intact 
aucin procesaed by the lactating gland and yet r acta with nol cules 
proceaaed by breast cancer cells, it was appropriate to examine the 
reaction of SM-S with a range of breast cancers. 

Reactivity of SM^S with breast tigauBB and tmonrs 

Ihe antibody SU-3 reacted with paraffin wmbedded tissues provided 
these were fixed in aethacam (not formal saline). Using this aethod 
for preparation of tissue sections, the reaction of the antibody was 
compared to that of HMFG-2 on breast tissues and tumours vith an 
indirect immunoperoxidase staining method. 5Biis analysis showed a 
dramatic difference in the staining pattern of SK-S compared to that 
seen with HMFG-2. Ihus, although a strong positive reaction was seen 
in 20/22 breast cancers stained with SM-3 (as compared to 22/22 
stained with HM?G-2), normal resting breast, pregnant or lactating 
tissues and most benign lesions were largely unstained with SU^3 but 
were stained with HMFG-2. Some examples of staining patterns of breast 
tissues and tumours are illustrted in Figure 6. 

Twenty-two primary carcinomas and fourteen benign lesions were 
•xamined and the reaction of Sk-S compared to the staining with HMFG-2 
in Mch case. In the primary carcinomas, staining with SM-3 was 
heterogeneous but generally quite strong and always confined to tumour 
cells; connective tissue and stroma showed no reaction (see figures 
6A,B), In the four fibroadenomas sxamined, staining of the spithelium 
with HMP(^^ was strong although heterogeneous. In comparison, 
staining with SU-S was negative in one caae and in the three others 
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•taining was confin. d to only one or two glandular elem nte. Hl£FG-2 
•bowed Btrong positivity on the five papilloaas and five caaeB of 
cystic diaeaae atudied while th staining bsenred with was very 

■uch weaker and aore heterogeneoua (figures 6G,H). Sie papilloaas as 
- a groop showed the atrongeit staining with SM-S, and it ean be seen 
that the staining was aeabraaous or extracellular. 

In contraat to BMPS-1 and aJPG-2 which strongly stain lactating 
and pregnant breast, SM-S was totally negative with three out of six 
eases of pregnant or lactating breast (see figure 6C and D). T»o 
positive cases showed only very weak staining of an occasional cell 
and in the third, staining was confined to two areas of one lobule. 
Again, in contrast to HM?G-1 and HliFG-2 which do react with some 
terminal ductal lobular units of noraal, resting breast (albeit 
weakly), Stf-3 was totally negative on eight out of the thirteen eases 
tested and in the other five eases staining was extrenely weak and 
often confined to one or two acini in the tissue section (see figure 
6E and ?). It should perhaps be noted that the intensity of staining 
with iaiPG-2 seen with noraal breast tissues and benign lesions fixed 
in oethacani was somewhat higher than that reported previously using 
fonaalin fixed material (50,4?. 

was also shown to be negative on sections of normal liver, 
lung, thymus, sweat gland, spididymus, prostate, bladdir, small 
Intestine, large Intestine, appendix, thyroid snd skin. Xhe sntibody 
showed weak positive staining only with the distal tubules of the 
kidney, the oecaaional chief cell of the stomach, the occaaional duct 
cell of the salivary gland and the sebaceous gland. 
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DlBcusBlon 

iMTge &oI cular veight oucin nolecules ax zpressed hj aany 
carcinomas and c&n-y aany of the tumour UBOciated antigenic 
detenninantB recognised by aonoclonal antibodiea. Ibeae epitopes saj 
also be expressed by some normal epithelium , and some sionoclonal 
antibodies like Hi£FCJ-1 react particularly well with a ttucin found in 
normal human silk (1,17), As long as the study of the anicins is 
restricted to their detection with antibodies reactive with undefined 
epitopes, the knowledge of their structure, expression and procbssing 
will also be restricted, fe have begun to investigate the structure 
end expression of the c&jiciiary mucin by isolating the core protein and 
developing antibodies which have allowed us to select partial cDNA 
clones for the gene coding for the core protein • This Example 
deecribes the production end characterization of these antibodies. 

Treatment of the HlIFG-1 affinity purified milk mucin with hydrogen 
fluoride resulted in the appearance of a dominant band of about 68£ 
daltone and a minor species of about 72II) on SDS acrylamide gels* 
These bands showed no reactivity with lectins » including Helix pomatia 
agglutinin which is specific for N-acetyl galactosamine^ the first 
sugar in 0-linked glycosylation {55). It therefore seems probable 
that this 661 dalton polypeptide represents the core protein of the 
sucin« Supportive wvidence for this comes from the observation that 
the antibodies described here, which are reactive with the stripped 
661 component I can precipitate a molecule of this size from the in 
vitro translation products of &K}iA isolated from breast cancer cells 
expressing the mucin , 
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As th milk ttucin contains at 1 ast '50% carbohydrate ( 16 ) ,a 
protein core of only 68ED appears too amall if the intact aol ciale has 
an ohs IV d mol cuL'ar weight great r than 400ZD. However,, mucins can 
be coapoaed of aaall aubunits which aggregate and are held together by 
some fonn of non-coral ent Interactiona , ms yet not understood. ?or 
example , although the molecular waight of the ovine aubmaxillary aucin 
has been reported to be greater than 1x10^ daltons (45^X it has a 
protein core of only 650 amino acids with a molecxjlar weight of 58,500 
daltons ) . 

An unexpected finding was that the antibodies HffC-1 and Hl£FG-2 
which react with the. milt mucin, also show a positive reaction with 
the extensively stripped material which ahowed no lectin binding 
capability. Previous indij-ect evidence, including the resistance to 
fixation, boiling and reduction, the repetitive nature of their 
epitopes and the appearance of several bands on immunoblots, had led 
to the belief that carbohydrate present on the milk mucin was involved 
in these epitopes. This idea was reinforced by the observation that 
lectins could block the binding of a£?G*l and 2 (1 ) . While it is not 
possible to exclude the possibility that some augara, undetected by 
the lectin binding experiments, remain on the extensively stripped 
aucin described here, this is unlikely to be the explanation for the 
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r activity of the antibodiea HMFG-I and 2. This can be said since 
both antibodiea hav recently been ahown to react poaitively with 
^-galactoaidase fusion proteins eipreaaed by pha|:e carrying DNA coding 
for the core protein of the aammary ancin , It appears therefore that 
at least part of aach of the epitopes recognised by BUPG-1 and HMPG-2 
contain amino acids but it mist be assuzDod that some of these epitopes 
on the core protein are exposed, i.e. not aasked in the fully 
glycosylated molecule. She epitope is however less abundant on 

the milk mucin than the HMFG-1 epitope , while it is readily detectable 
on the mucin molecules expressed by tumours (1). These molecules have 
a smaller molecular weight and may be less heavily glycosylated or 
polymerised. 

Here we have reported the development of new antibodies lAich are 
reactive with the protein core of the mucin and with the partially 
deglycosylated molecule, but which are unreactive with the fully 
processed mucin produced by the lactating mammary gland. One of these 
antibodies which is an IgGI, has been studied in more detail. 

It has been shown to react with the mucin molecules which are produced 
by breast cancer cells and are recognised by many antibodies developed 
against the intact milk mucin. It should be smphasixed however that 
the epitope recognised by SM-3 which is on the core protein and is 
•xposed in the mucin as processed by tumour cells, is not exposed on 
the normally processed milk mucin, lliis feature offers the 
possibility of snhanced tumour specificity, and a pilot 
imaunohistochomical study of breast tumours and tissues has shown that 
indeed the SM-S antibody r acts strongly with the majority of primary 
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br ast cancers (91 J) but ahoirB littl or no reaction with benign 

br ast tmours, resting or lactating breast, and most normal tissues. 

Ihere are aeveral implications of the work described here which 
»ay be important for both basic and clinical studies in breast cancer. 
The observation that parte of the core protein (detectable by 
antibodies) are exposed on the amcins as processed by breast cancer 
but masked on the mucin as processed by cells in aoimal breast and 
benign tumoOTS implies that there is an alteration in the processing 
of the mucin in malignancy. A more detailed atudy of the processing 
of the mucin in normal and malignant cells may then give basic 
information for defining the malignant cell. Moreover, since the 
Bpecificity of the reaction of the antibody SM-3 for tumours is better 
than that of antibodies developed against the intact macin, this 
antibody may prove to be a more effective diagnostic tool for the 
detection.of breast cancer cells in tissue sections , tissue fluids and 
cells. The reactive components are membrane associated as well as 
intracellular and in vivo localisation of tumours may also be 
possible. 



wo 88/05054 



"35 



PCT/UBSS/OOOll 



AbbreviationB 

Ihe «bbreviRtionB used are: BiPO, humaa ailk fat globule; PBS, 
phosphate-buffered aalise (153 all VaCl, 5 mU ICL, 10 mM Ma-HPO , 2 mU 
^gJ'O^PH 7.4); WGA, wheat gera agglutinin; PHA, peanut «gglutinin; 
HPA, Helix ponatia agglutinin; BSA, bovine aeruB albumin; SDS, aodium 
dodeeyl sulfate. 
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Example 2 - - 

Purification and deglycosylcrtion-.of h^utian milk mucin was 
conducted as in Exairple 1 mucin was purified on an HMFG-1 
antibody. 

She stripped mucin preparations vere separated by electrophoresis 
tlirough HaDodSO^/polyacrylamide gels (10%) and silver stained by two 
methods r one of vhich can be used to stain highly glycosylated 
proteins (22,23) • 

Preparation of ■polTclonal rabbit Bntisemm to strirped core Drotein 

One Hew Zealand White rabbit was inmnnized with 100 of the 
partially stripped core protein in complete Preund^s adjuvant (Gibco), 
Booster Injections of 500 jig of the totally stripped core protein were 
administered in incomplete Preund's adjuvant (Gibco) S and 4 weeks 
after the initial injection and the rabbit was bled one week later. 
Ten microliters of immune serum (75 ^g/ml protein) precipitated 200 ng 
of fully stripped core protein in a Protein A assay (24) and detected 
it on immunoblots. The immunoglobulin fractions of rabbit preimmune 
and rabbit anti-mucin core protein were prepared by adding ammonium 
sulfate to 50% saturation. The resulting pellet was resuspended in 
one-half the original serum volume of PBS and dialyzed against the 
same buffer. JLfter dialysis , only residual precipitate was removed by 
centrifugation. immunoglobulin fractions were stored in aliquots at 
-20 'C. 
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Descri'ption of MAba used 

In addition to the polyclonal antiserum used for initial screening, 
a cocktail of two MAbs, SM-3 ^ ^-^^^ ^^^"^ recognise 

■uein core protein (20) and HUFG-I aiid (1.14) v«r» osad to 

screen the purified plaques, the p-galaetoiidase fusion proteins and 
for Immunoprecipitatioaa froa in vitro translated proteins*. Other 
MAbs used were a Bonodonal snti-^-g^^tosidase antibody (25) vhieh 
vas a gift fros H. Durbin (ICR?, Lozidon), an snti-interferon antibody^ 
ST254 (24) t a keratin antibody (26) and U1 8 vhich recognizes a 

carbohydrate structure on the silk mucin (27). 

In Vitro translation of proteins 

RKA was isolated from the human breast cancer cell line llCF-7 
using the guanidium isothiocyanate method of Chirgwin et al. (28). and 
poly(A)* RHA was purified by chromatography using oligo (dT)«celluloBe 
(K9W England 5io Labs). The poly(A)'*' KSA was translated in a 
reticulocyte lysate system (Amersham) in the presence of [^^S] 
methionine (1000 Ci/omole; 1 Ci 37 GBq, Amersham). Samples 
containing 5x10^ acid insoluble epm were precipitated in a protein A 
assay (24) using UAbs KUPG-1 , HI£7G«2 and a control 

antibody to human interferon. The antibody«« elected proteins were 
then separated on a 10% HaDodSO^/polyacrylamide gel. impregnated with 
Amplify (Amersham) and exposed to XAR-5 film (Kodak) at -70 *C. 

*The l£AbB SU^ and SU**4 (SU refers to stripped mucin) show strong 
reactivity with the partially and fully stripped core protein but no 
reactivity with the fully glycosylated mucin (20). 
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Antibody sere nine of Xgt11 librarT and vrot in blotting 

The Igtll library iieed in this study was constructed from inRNA 
isolated from the human breast cancer cell line MCF^l g rr^ was 
generously provided by Philippe falter and Pierre Chambon (Strasboxu^g, 
France). She poly (A)"^ EHA used for the preparation of the randomly 
primed library was prepared from mEKA that sedimented faster than 28S 
rRNA and vas enriched in estrogen receptor (29). The library was made 
esflentially as described by Huynh et al. and loung and Dayis (30-S2) 
and contained approximately 1 x 10^ recombinants per |ig of RNA. 
Between 85% and 95% of the plaques contained inserts. 

The phage library was plated onto bacterial strain Y1090 and grown 
for S hr at 42 'C. After isopropyl p-D-thiogalactoside (IPTG) 
induction and S hr of growth at 37*0, filters were prepared from each 
plate and screened with anti-mucin core protein antibody by the method 
of loung and Davis (32).. The first antibody used in acreening was the 
rabbit antiserum raised against the stripped core protein prepared as 
described above. Prior to use in acreening, the antiserum was diluted 
1:200 in PBS containing 1% bovine serum albumin (PBS/BSA). 
Preabsorption with T1090 bacterial lysate was not found to be 
necessary. The nitrocellulose filters (Schleicher and Schuell) were 
blocked by incubation in PBS containing 5% BSA for 1 hr at room 
temperature with gentle agitation. The filters were incubated at room 
temperature overnight with a 1 :200 dilution of antiaerua in heat 
sealed plastic bags. The filters were washed 5x5 min in PBS/BSA, 
and bound antibody was detected by using horaeradiah 
peroxidase-conjugated she p anti-rabbit antiserum (Dako) diluted 1:500 
with PBS/BSA and incubated_ for 2_ hr with th_ f ilt_erB . _ Th filters 
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were vasbed 5x5 ain in FBS/BSA and 1 z 10 nin in PBS before color 

detection using 4-cblor -1-napbthol (1). Inmun reactiv bacteriopbage 

were picked and purified tbrougb two additional rounds of screening. 

Subsequently I bacteriophage Inserts vere sxibcloned into the £coKI 

sites of pUC8 (33) producing tbe plasmid used most extensively » pMUC 

10* Xhe plasfflids were maintained in I3H1 cells. 

To ezaaine the PHS^&ctosidase-cDKA fiision proteins for 

imaimoreactivity, cell lysates were derived. I^rsogens were prepared 

as described in Toung and Davis (34), Cells were pelleted, suspended 

in Laeomli sample btiffer (35) and separated by electrophoresis through 

MaDodSO /polyacrylamide gels (10%) and transferred onto nitrocellulose 
4 

filters as described (1,36). The filters were treated as above for 
antibody screening. 

Korthem Analysis 

RNA was isolated from tissue culture cells and frozen tissues by 
the guanidinium isothiocyanate method of Chirgwin et al. (28). Total 
ENA (10 |ig per lane) was denatured by heating at 55*C for 1 hr in 
deionised glyozal and fractionated by electrophoresis through a 1.3% 
glyozal gel (56}. The HKA was transferred to nitrocellulose 
(Schleicher and Schuell)t prehybridired and hybridized as described by 
Thomas (S4). Filters were washed down to 0.1Z SSC with 0.1^ SDS at 
65 *C and exposed to ZAB*5 film (Kodak) at -70 *C with intensifying 
screens. 
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Southern analysis 

High molecular weight genomic BNA was prepared from white ^lood 
cells and cell lines (39»40}. These genomic DNAs (lO^g) were cleaved 
with restriction enzTmes following the manufacturer's recommended 
conditions and fractioned through 0.6% and 0,1% agarose gels. Cloned 
plasmid DNA was cleaved and fractionated on.1«S% agarose. Ihe gels 
were denatured « neutralized and transferred to nylon nembranes 
(Biodyne) according to the manufacturer's instructions. The £coR1 
insert from pMUC1 0 was separated on a 1% low melting point agarose 
(Biorad) gel. arid labelled with [o- P]dCTP by the method of random 
priming (41) end hybridized to' filters at 42 *C. Filters were washed 
down to 0.12 SSC with 0.t% SDS at 55 'C and exposed to IAR-5 film 
(Xodak) at. -70*C with intensifying screens. 



Results . 

Purification and deglycosylation of mucin glycoprotein 

Ifucin glycoprotein reactive with the monoclonal antibody HU7G-1 
was prepared from pooled human breast milk by using an 1Q£7&<»1 antibody 
affinity column , followed by molecular sieve chromatography on 
Sephadex 0-75 ia order to remove lower molecular weight components 
(Figure 7, lane 1). In order to . demonstrate the homogeneity of the 
purified molecule, amino acid analyses of four separate preparations 
were performed and revealed a fairly consistent composition with 
serine, threonine, proline, alanine and glycine accounting for 56% of 
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the amino acids 



Periodic acid silver stained gels 



revealed a diffus band of gr ater than 400,000 daltons visible only 
when the gel was treat d with periodic acid before th silver stain 



Tisualired on the gel using the silver stain without prior treatment 
with periodic acid. 

The purified material was subjected to treatment with hydrogen 
fluoride to remove the 0-linked sugars that are characteristic of 
mucin glycoproteins. Two different reaction conditions were used 
which resulted in a partially deglycosylated core protein (treated at 
0*C for 1 hr) and a fully deglycosylated core protein (treated at room 
temperature for 3 hr) as determined by iodinated lectin binding 
following separation by gel electrophoresis and transfer to 
nitrocellulose paper (20). Xh© partially deglycosylated core protein 
was reactive with wheat germ agglutinin, peanut agglutinin and helix 
pommatia lectin (which recognizes the lin k age sugar 
K-acetylgalactosamine) whereas the fully stripped protein showed no 
reactivity with any of these three lectins. 

The hydrogen fluoride treated core protein was separated by 
electrophoresis through NaBodSO^/polyacrylamide gels (10%) and silver 
stained. Silver staining revealed that the predominant component of 
the partially stripped mucin was a high molecular weight band of about 
400 kd, although faint bands of lower molecular weight could also be 
observed (?ig«8 , Isne 1). Since the high molecular weight material 
showed a somewhat increased mobility in the gel and reacted with the 
lectin recognising the linkage sugar, it can be assumed that some 
sugars had been remov d. The fully stripp d mucin consisted of two 
bands of abo\rt 68 kd "and' 72 kd (Pig. lane 2). " ~ - - 



(Pig. 7f 1*111® 2). Ho other lower molecular weight bands were 
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Antibody reactive proteins produced by MCF*7 cells . 

The i(CF-7 breast cancer cell line expresses large amounts of 
HUFG^t and reactive material on its cell surface (14) and was thus 
judged to be a suitable source of &KKA for a cDNA library. Before 
proceeding to screen the UCF*7 library with the monoclonal antibodies, 
they were tested for their ability to precipitate a component from in 
vitro translation products produced from MCP-7 mRNA. Poly (A)"*" RNA 
from MCF-7 was prepared and translated in vitro . Proteins from the 
translation reaction were immunoprecipitated using the monoclonal 
antibodies HMFG»1» HMFG-2; SM-3 and SU-4r and displayed by 
polyacrylamide gel electrophoresis and fluorography (Fig, 9), Two 
proteins of about 68 kd and 92 kd were immunoprecipitated by SM-3 
(lane 2} and SU-4 (lane 1}« It was also found that HUFG-1 (lane 4) 
and HMFG-2 (lane 3} immunoprecipitated these proteins; however ^ no 
bands in these areas were precipitated by an irrelevant monoclonal 
antibody to himan interferon (lane 5)« The fact that .HMFG-1 and -2 
immunoprecipitated these proteins was an unexpected finding as it was 
previously thouglit that these KAbs recognize carbohydrate 
determinanta (1). However, we also found that HUFG-1 and -2 react 
Tery atrongly with the fully stripped, iodinated core protein (20), 
These results together with the MAb reactions on the p^alactosidase 
fusion proteins (see below} confirm that the epitopes for BUF0*1 azid 
'-'Z are, at least in part, protein in nature. 

The abundance of the core protein mRNA in total cellular poly (A)"*" 
RNA was 4% as stimated by comparing th amount of ('^S)methionine 
present' as ~immunopi*ecipitated protein to th amotmt of methionine 
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incorporated into total prot in during in vitro tranalati n, 

Scr ening of the cDNA library 

The Xgt11 cWA library aade from aize eelected llCF-7 &RNA (aee 
Methods) was screened initially vith the polyclonal antiaerum sade to 

the Boicin core protein which had been stripped of its carbohydrate. 

g 

Screening of 2 z 10 plaques resulted in 11 positive clones ^ 7 of 
which were taken successfully through two further rounds of plaque 
purification. 

To demonstrate that the reactivity of the phage clones with the 
antibody probes was due to antigenic deteiminants on the cDMA 
translation product, p-galactosidase fusion proteins were made from 
all 7 clones. The proteins were separated by electrophoresis , 
transferred to nitrocellulose paper and probed with a variety of 
antibodies to the stripped mucin , including the polyclonal antiserum 
which was used initially to select the clones and a cocktail o'f 
and SM-4. In addition, HMFG-1 and HMFG-2t the two monoclonal 
antibodies which originally detected this differentiation and 
tumour-associated epithelial mucin (1»14} were tested. ^1 7 clones 
yielded fusion proteins i^ich were specifically recognized by the 
polyclonal antiserum, the monoclonal cocktail, and EUFG-1 
antibody reacted with 6 of the 7 fusion proteins and failed to 
recognize the protein from clone 9 which contains the amallest insert. 
In every case the strongest signal was given by the HU?Cr-2 antibody 
and this reaction ia shown in Figure lo. Monoclonal antibodies to 
keratins and to a carbohydrate epitope on this fully glycosylated 
mucin w re used as controls and show d no reactivity 
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A monoclonal antibody to ^-galactosidase vas a positiv control and 
the band recognized correlated in every case vith the band recognized 
by the "specif ic antibodies. The sizes of the fusion proteins varied 
in proportion to the sizes of the cDNA inserts found in the 
bacteriophage, 

Gharacterization of cDNAs and RNA blot analysis 

The. inserts from the X clones were designated pUUC3-1 0 (omitting 
pMUCS) and were subcloned into the vector pUC 6 for easier 
manipulation* The 7 clones were compared to each other for sequence 
homology. Each of the plasmids was digested with EcoRI and the insert 
separated on a 1.4% agarose gel. The largest cDKA insert from 'pW}O^Q 
was used to probe the inserts and found to hybridize to all 6 inserts 

(Pig .11). pMUC 7 was found to contain two inserts following digestion 
with EcoRI; however, only 1 of the inserts hybridized to the pMUCiO 
probe , The insert bands were not derived from phage DNA since the 
pMUCI 0 probe did not hybridize to Hind Ill-digested X phage DNA 

As aliown by agarose gel electrophoresis (Pig.H), the 
inserts vary in size from about 200 to up to about 1800 bp. ^e 
largest insert from pMUCIO has been used as the hybridization probe in 
all subsequent experiments. 

Because the IlfUC clones were identified only by antibody binding » 
we needed additional assurajice that they were indeed coding for the 
breast epithelial mucin. To determine the authenticity of pUUCIO, we 
correlated the presence of mRNA hybridizing to the clone with mucin 
expression in various cell lines. As shown in figure 12ithe cDNA 
hybridized to two transcripts f 4,7 kb and 6.4 kb in th RNA from the 
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br ast cane r cell lines MCP-7 and T47D which wer ahown pr viously to 

xpr as th Hl£FG-2 antigen (1,14). Significantly, th pMUCI 0 prob 
hybridized to transcripts of approximately the same six in Rl^A 
extracted from normal aasmary epithelial cells cult-ored from milk 
(42). A third band of 5.7^ can be seen in the RKA from these normal 
cells. In contrast, three human cell types that lack the mucin, 
breast fibroblasts, Baudi calls and HS576T, a carcinosarcoma line 
derived from breast tissue (43), showed no detectable pMUCI O-related 
mMA. The 6.4 Icb band appears to be the most adundantly expressed. 
The presence of at least two sizes of mRJIA from l£CF-7 cells correlates 
with the immionoprecipitation of two proteins of (molecular weights 68 
kd and 92 kd) from in vitro translated mRKA from MCF-7 cells. The 
normal mammary epithelial cells were derived from pooled milk samples 
and the additional transcript observed may be due to poljmorphismB 
among individuals . 

Genomic DNA blot hybridization and detection of a restriction fragment 
length polymorphism (RFLP) 

Genomic DNA was prepared from a panel of ten individuals 
consisting of six unrelated individuals and a family of four, and from 
three cell lines. The DNAs which were digested with Hinfl or EcoRI 
and blotted and hybridized to the radiolabelled pUUCMO insert, exhibit 
restriction fragment length polymorphisms. !Die restriction fragments 
from the ten individuals and three cell lines are shown in figure 13, 
The pattern consists of either a single band or a doublet of sizes 
ranging from S400bp to 6200bp in the Hinfl digest (with the exception 
of the ZR75-1 DNA in lane 12, figure 13a which shows thre bands) or 
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from 8200bp to 9600bp in the EcoRI digest (Figure 13 B ). There appears 
to be a continuous distribution of th fragment nizes which implies a. 
high in vivo instability at the locus. The pattern of fragments 
observed in the family of four (lanes 1-4) suggests that these 
fragments are allelic. Preliminary atudies of the DMA made from white 
blood cells of normal, related individuals Indicate the existence of a 
number of independent alleles with an autosomal codominant mode of 
inheritance These 
studies will be the subject of a separate investigation. 

• Discussion 

The cDNA clones- described here which were obtained from the MCF-7 
Igtll library were selected using polyclonal and monoclonal antibodies 
prepared against a normal cellular product, the. milk mucin in its 
deglycosylated fonn. This was done because it was easier to obtain 
large quantities of the mucin for stripping than to prepare similar 
quantities of immunologically related glycoproteins expressed by 
breast cancer cells (44). The fact that the antibodies did select for 
cDNA coding for nonglycosylated core protein molecules in MCP-7 cells, 
strongly suggests that the glycoproteins in these cells , which were 
originally detected by their reaction with antibodies to the milk 
mucin, contain the same core protein as this oucin. This is confirmed 
by the detection of nRHAs of approximately the same sizes in the 
nonsal and malignant cells, using one of the probes isolated from the 
MCP-.7 library. W will therefore ref r to the antibody reactive 
glycoprot ins on breast cancer c lis as mucins, bearing in mind that 
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th ir pr ceasing may be diff r nt resulting in mol ules of different ' 
molecular w igbts but irith th sam core protein as that of the milk 
mucin* 

Seven clones were obtained from the MC7-7 library of which the 
largest was IBOOkb, This clone cross hybridized with the other 6 
Asialler clones. She ^-galactosidase fusion proteins expressed by six 
of the cross-hybridizing lambda clones were reactive with the 
polyclonal antiserum directed against the mucin core protein as well 
as with four well-characterized monoclonal antibodies directed to 
various epitopes on the stripped core protein, SU-3f HU7G-1 and 

(149 20)* The smalleet lambda clone » XMUC9« produced a 
p-'galactosidase fusion protein which reacted with three of the four 
monoclonal antibodies and with the polyclonal antiserum. 

The surprising result that the extensively characterized HMFG-I 
and HMFG-2 monoclonal antibodies reacted strongly with the lambda 
plagues and the fusion proteins and could immtmoprecipitate proteins 
from in vitro translated mHNA provides strong evidence that these 
clones do indeed code for a portion of the mucin core protein. 
Although previous evidence such as resistance to fixation , boiling, 
treatment with dithiothreitol and NaDodSO^ and the presence of 
multiple epitopes on the molecule suggested that these were 
carbohydrate (1 ) » it has now been established that the epitopes of the 
HU?G-1 and HUPG-2 monoclonal antibodies are definitely protein in 
nature. Carbohydrate may be required to obtain the, strongest binding, 
•ither as part of the epitope or by conferring some conformational 
change on the protein portion, but part of the antigenic determinant 
must consist of an amino acid sequence. Since these two MAbe are 
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r active with the fully glycosylated milk nucin as well as the 
stripped core protein, this data means that the intact molecule 
contains areas of naked peptide which contribute to the antigenic 
sites for these two antibodies. 

Confiimatory evidence that pMUCIO codes for the mammary mucin core 
protein is provided by RMA blots. Ihe relative abundance of mRMA in 
the breast cancer cell lines MC7-7, 2R-75-1 and in normal 

mammary epithelial cells corresponds to the antigen expression by 
these cells as measured by the binding of the HM?G-l and aiPG-2 
monoclonal antibodies. Cell, types which are negative for antigen 
expression such as human fibroblasts, Daudi cells and HS578T, a 
carcinosarcoma line derived from breast (14), are negative in RHA blot 
hybridizations* A fortuituous observation made with the ZR-75-1 
cells yielded indirect strong evidence that pMUCIO does indeed code 
for the mucin glycoprotein core protein. This cell line, which 
routinely expresses large amounts both of mRHA and antigen, yielded 
one preparation of MA which was unexpectedly negative by blot 
hybridization. It was subsequently found that those particular 
ZIi-75-1 cells from which the RMA had been made had lost the expression 
of the antigen as well at this time (as determined by reaction with 
aiPC-1 and 2). Different passage numbers of the ZR-75-1 cells were 
recovered and. shown once again to express both antigen snd message. 
The sixes cf the messages. 4.7 J± and 6.4 kb, «^ quite large, since a 
68 kd or 92 Xd proteiii would need only about S kb to code for the 
protein portions. Ihis suggests that a large portion of the nRMA 

maybe untranslated. Efforts are underway to obtain a full-length 
clone. 
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Hius, the cDKA clones presented here represent a portion of the 
gene coding for the human mnmmary mucin vhich is zpresaed by 
differentiated breast tissue as veil as by most breast cancers. The 
sajor proteins precipitated from in vitro translation products of RKA 
from UCF*7 cells by antibodies to the milk mucin core protein (68£d) 
have an apparent solecular weight of 6B£d and 92£d* These proteins i 
produced by the breast cancer cell therefore share epitopes with the 
6B£d core protein of the milk mucin (20). Whether a similar 02£d 
protein is also produced by normal mammary epithelial cells , and is 
truncated or destroyed by HF treatment is not yet dear. UCF-7 cells 
biosynthetically labelled with 14C amino acids yield upon 
immunoprecipitation with HMFG-1 and Hli£FG-2 antibodies, two 
glycosylated proteins of 320 kd and 430 kd, and it 

is possible that each of these glycoproteins utilizes only one core 
protein of either 68£d or 92Id. Alternatively, each of the 
glycoproteins could contain both the 92£d and 6B£d proteins either in 
different proportions or variably glycosylated. Further screening of 
the library may yield full length cDNAs coding for both sizes of the 
immunologically related core proteins. Since there appears to be only 
a single gene (based on Southern blot data obtained by using a partial 
cPNA probe} p it is probable that the multiple messages arise by 
alternative EMA splicing and this would explain the fact that they 
contain common sequences. Although a core protein of 58 kd appears to 
be small to yield a fully glycosylated molecule of greater than SOO kd 
which contains 50% carbohydrate » there is evidence that such a 
structure for mucins is possible. Ovine submaxillary mucin has a 
report d molecular weight f 1 x 10^ daltons (45), y t its protein 
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core consists of 650 amino acids resulting in a molecule of 58 kd 
(46). 

The mucins whict are detected with HMPG-1 and HM?G-2 ICAbs on 
immunoblots of tumours and breast cancer cell lines show Tariatione in 
■ize from 80 fcd to 400 kd in the molecular weights of the tumour mucin 
molecules (1,47). Using these same antibodies which detect high 
molecular weight mucins present in normal wine, a polymorphism has 
indeed been ahoTO to be genetically determined (48). ilthough the 
Tery low molecular weight components are likely to represent precursor 
fotos of the mucin which appears to be incompletely processed in many 
tumour cells (20), the vapiations in the higher molecular weight 
components are likely to be due to this genetic polymorphism. It was 
unclear, however, liiether the structural basis of the polymorphism was 
due to the -genetically determined protein or to the carbohydrate 
portion of the mucin. The detection of restriction fragment length 
polymorphisms in the Southern blotting experiments using the mucin 
probe suggest that the mucin polymorphism occurs at the level of the 
DNA which codes for the protein. Preliminary sequence data 
yfeuggest that the basis for this polymorphism is a region 
of variable tandem repeats present in the protein coding sequences. 
This structural feature may be responsible for the generation of the 
many allelic restriction fragments at the mucin locus. We are 
presently investigating the basis of the mucin polymorphism by a 
Southern blot surrey of DHA from white blood cells of normal, related 
individuals whose Inheritance pattern of urinary mucins has been 
determined. In addition, w are examining DNA preparations made from 
. J^*L*.* J*''"°°i cel ls sn d tumours, of indi vidual breast cancer patientB_ 



SUBSTITUTE SHEET 



wo 88/05054 



51. 



PCT/GB88/00011 



to deteraise if there is any diecordance between genotype in the 
pair d samples, since tandemly repeated DNA may provid an unstable 
sit vhere recombination or amplification could occur. 

The presence of mucins in the majority of carcinomas and their 
association vith the differentiation of mammary epithelial cells makes 
it particularly important to identify regions involved in the tissue 
specific and developmental regulation of the gene. Moreover, the 
introduction of a functional mucin gene into cells should provide 
insights into the role of this molecule in breast epithelial 
differentiation and possibly enable us to identify any alterations in 
the function or expression of the mucin which are related to malignant 
transformation in the human breast « 
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AbbreviationB 

The abbreviationB are as follows: PBS, pbospbate-buffered aalxne; 
mb, aonodonal antibody ;.IPTG. iaopropyl p-D-thiogalactoside; bp, 
base pair(8); Sb, kilobaBe(8). 



SUBSTITUTE SHEET 



wo 88/05054 PCr/GB88/000ii 

-53- 

lABLE 1 

Anlno acid eoapoBltion of the human milt nuein - eonrparlBon with PAS-O 



ABino acid HM7G-1 purified fetaaaively stripped PAS-0 

■ilk Bucia Bilk Bueia (Shlaira ft laaauchi 

1982) 



Asp 6.1 7.2 

Ihr 9.4 B.7 

Ser 9.1 13.0 

Glx 6.3 9.6 

Pro 14.8 14.4 

'^y 8.1 10.1 

Ala 12.3 11.9 

Cys Hot analysed ITot analysed 

Tal 6.0 6.S 

Met 0.5 0.4 

Il« 1.6 1.7 

4.5 4.8 

lyr 2.0 0.9 

H»« 2,0 1.6 

Hia S.2 2.5 

2.8 S.S 

Arg 4.0 4.0 



6.4 
9.8 

15.1 
8.3 

12.0 

12.2 

15.0 
0.5 
5.3 
0.8 
1.9 
5.7 

1.6 

1.7 

5.8 

2.2 

5.9 
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Table g 

R activity of the antlbodieB en In t act. T.arti«l.iv and totally 
deglycoBTlat d milt nuein 



125 



I Cfm bound 



iatibody 


Intact Boleeule 


Partially 


Totally 








M -t- ^ ■■ ■ 1 1 |1 • 

■xrijipea amcln 


5.17 


8,524 


11,925 


5,780 


9.15 


525 


3, 000 


3,328 


8U-3 


465 


15,414 


9,200 


SlI-4 


81 6 


16,750 


9,561 


HU7G.1 


32.000 


33,768 


9,494 




29,500 


29,230 


15,832 


NS2 aedium 


397 


845 


650 



The binding of the uitibodies to iodinated intact, partially and 
totally deelycosylated milk aracin was assayed using the protein A 
plate aethod as described in Materials and Methods. 
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CLAIMS 

1. An antibody or fragment thereof against a 
human mucin core protein which antibody or fragment has 
reduced or substantially no reaction with fully expressed 
human mucin glycoprotein. 

2. Human polymorphic epithelial mucin core 

protein, 

3. A polypeptide comprising 5 or more amino 
acid residues in a sequence corresponding to the sequence 
(I) 

Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala 
Pro Gly Ser Thr Ala Pro Pro Ala His Gly 

4. A polypeptide according to claim 3 having 20 
or more amino acid residues in a sequence corresponding to 
the sequence (I) 

Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala 
Pro Pro, Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala 
Pro Gly Ser Thr Ala Pro Pro Ala His Gly 

(I) 

5. A polypeptide according to claim 3 or claim 
4 wherein at least one amino acid residue bears a linkage 
sugar substituent. 

6". A~porypepti~de~a~ccord^ to claim 5 wherein 
the' linkage sugar bears an oligosaccharide moiety. 
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7. A polypeptide according to claim 5 or claim 
6 wherein amino acid bearing a substituent is a serine or 
threonine and the linkage sugar is N-acetyl galactosamine. 

8. A polypeptide according to any one of claims 
5 to 7 linked to a carrier protein. 

9. An antibody or fragment thereof against a 
polypeptide according to any one of claims 3 to 8 which 
antibody or fragment has reduced or substantially no 
reaction with fully processed human mucin glycoprotein, 

10. An antibody or fragment thereof according to 
claim 1 or claim 9 against a human polymorphic epithelial 
mucin core protein. 

11. An antibody or fragment thereof according to 
claim 10 against human polymorphic epithelial mucin core 
protein as expressed by a human colon, lung, ovary or 
breast carcinoma. 

12. An antibody or fragment thereof according to 
any one of claims 1 and 9 to 11 which has no significant 
reaction with mucin glycoprotein expressed by pregnant or 
lactating human mammary epithelial tissue. 

13. A monoclonal antibody or fragment thereof 
according to any one of claims 1 and 9 to 12. 

14. A hybridoma cell capable of secreting a 
monoclonal ant ibody according to claim 13. 

15. A hybridoma cell of the cell line designated 
HSM3 (ECACC 87010701) . 

- - - — 16. " A'mbnbclohal antibody "secreted by HSM3 
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(ECACC 87010701) . 

17. A nucleic acid fragment comprising at least 
17 nucleotide bases the fragment being hybridisable with at 
least one of 

a) the DNA . sequence 

5' * 
ACC GTG GGC TGG GGG GGC GGT GGA GCC CGG- 

GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC- 

* 

ACC GTG GGC TGG GGG GGC GGT GGA GCC CGG- 

3" 

GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC 

b) DNA of sequence 

5' 

GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC- 

* - 

CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT- 

GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC- 

* " 3' 
CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT 

c) . RNA having a sequence corresponding to the DNA sequence 

of^ -a^— and — 

d) RNA having a sequence corresponding to the 
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DNA sequence of b) . 

18. A nucleic acid fragment according to claim 
17 comprising a portion of at least 30 nucleotide bases 
capable of hybridising with at least one of sequences (a) 
to (d) • 

19. A DNA fragment according to claim 17 or 18. 

20. A double stranded DNA fragment comprising 
antiparallel paired portions having respectively sequences 
(a) and (b) as defined in claim 17. 

21. An antibody or fragment thereof according to 
any one of claims 1, 9 to 13 and 16 bearing a detectable 
label or a therapeutically or diagnostically effective 
moiety. 

22. An antibody or fragment thereof according to 
any one of claims 1, 9 to 13, 16 and 21 for use in a method 
of therapy or diagnosis practised on the human or animal 
body. 

23. Human polymorphic epithelial mucin core 
protein bearing a detectable label or a therapeutically or 
diagnostically effective moiety. 

24. Human polymorphic epithelial mucin core 
protein according to claim 2 or claim 23 for use in a 
method of therapy or diagnosis practised on the human or 
animal body. 

25. A polypeptide according to any one of claims 
3 to 8 bearing a detectable label or a therapeutically or 
diagnostically effective moiety. 

substjtutl: a^EST 
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26. A polypeptide according to any one of claims 
3 to 8 and 25 foe. use in a method of. therapy or diagnosis 
practised on the human or animal body, 

27. A. nucleic acid fragment according to any one 
of claims 17 to 20 bearing a detectable label or a 
therapeutically or diagnostically effective moiety. 

28. A nucleic acid fragment according to any one 
of. claims 17 to 20 and 27 for use in a method of therapy or 
diagnosis practised on the human or animal body. 

29. An assay method comprising contacting a 
sample suspected to contain abnormal human mucin 
glycoproteins with an antibody or fragment thereof 
according to any one of claims 1, 9 to 13, 16 and 21. 

30. A diagnostic or therapeutic method practised 
on the human or animal body comprising administering an 
antibody or fragment thereof according to any one of claims 
Ir 9 to 13, 16, 21 and 22. 

31. - A diagnostic or therapeutic method 
practised on the human or animal body comprising 
administering human polymorphic epithelial mucin core 
protein according to any one of claims 2, 23 or 24. 

32. A diagnostic or therapeutic method 
practised on the human or animal body comprising 
administering a polypeptide according to any one of claims 
3 to 8r 25 and 26. 

33. A diagnostic or therapeutic method 

practised on. the -human or animal body comprising 

administering a nucleic acid fragment according to any one 
of claims 17 to 20, 27 and 2B^ 
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